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PLASMIDS AND METHODS FOR MONITORING 
ENDONUCLEASE DIGESTION EFFICIENCY 



FIELD OF THE INVENTION 

[0001] The invention relates to plasmids, either in circular or linear form, useful in 
monitoring the efficiency of a restriction endonuclease digestion reaction. 

BACKGROUND OF THE INVENTION 

[0002] Restriction endonucleases are a class of enzymes that occur naturally in 
bacteria and when they are purified away from other contaminating bacterial 
components, they can be used in the laboratory to break DNA molecules into 
precise fragments. They are the biochemical scissors by which genetic 
engineering and analysis is performed and, therefore, have proved to be 
indispensable tools in modern genetic research. 

[0003] Restriction endonucleases act by recognizing and binding to particular 
sequences of nucleotides (the "recognition sequence") along the DNA molecule. 
Once bound, they cleave the molecule within, or to one side of, the sequence. 
Different restriction endonucleases have affinity for different recognition 
sequences. Close to one hundred different restriction endonucleases have been 
identified among the many hundreds of baterial species that have been examined 
to date. 

[0004]Plasmid construction is an essential technique in the cloning and 
manipulation of genes. Most plasmid construction strategies are designed to 
ensure that insertion of a DNA fragment into a vector predominates over the 
default event in which the vector recircularizes without incorporating an insert. 
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There are four major ways of achieving high insertion ratios. They all rely, in one 
way or another, on rendering the vector incapable of circularizing without ligating 
to the insert fragment. For example, use of phosphatased vectors favors a high 
insertion frequency because dephosphorylated ends can only ligate to 
phosphorylated ends supplied by the insert. In utilizing this method, however, 
the phosphatase is difficult to control TA vectors, used for cloning PCR- 
amplified DNA, are designed to be linearized in such a way as to leave single 3' 
thymidine overhangs which can ligate only to adenine over-hangs like those left 
on the 3 T ends of DNA extended by taq (in this method ligation is catalyzed either 
by DNA ligase or DNA topoisomerase). The TA cloning polymerase method also 
favors efficient insertion of the PCR product- One limitation of this method is that 
not all thermostable DNA polymerases used for PCR leave the 3' adenine. Partial 
fill-in of most 5* restriction site overhangs by Klenow polymerase renders them 
non-circularizable; an insertable fragment is then generated by partial fill-in of an 
appropriate restriction site (e.g. a vector cleaved with Sail (GITCGAC), filled in 
with T and C can be ligated to Sau3AI (!GATC) fragments filled in with G and 
A). 

[0005] The fourth widely used technique is referred to as "forced ligation." This 
technique makes use of a vector digested with two different restriction enzymes to 
generate a non-circularizable linear molecule; an insert fragment generated with 
the same enzymes is ligated in, thereby closing the plasmid. Forced ligation of 
gel-purifed restriction fragments is the simplest and most efficient method of 
plasmid construction in situations where segments are transferred from one 
plasmid to another, or connected together in new combinations, e.g. in 
mutagenesis, in analysis of gene expression, and in numerous other applications. 
Double-digested vectors are also convenient for cloning PCR fragments into a 
construct directly, without the necessity of a TA cloning step, provided that the 
PCR primers are designed with appropriate restriction sites. 
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[0006] The forced ligation method provides a high percentage of insertion (often 
greater than 90%), provided that both of the vector sites are fully digested, a 
criterion which is not always easy to achieve, since many enzymes are of uncertain 
stability or are inhibited by contaminants present in DNA preparations. Hence 
demonstration of good digestion is highly desirable before proceeding with the 
construction; this is also problematic, since complete digestion is impossible to 
document by agarose electrophoresis when the two sites are in a polylinker; and 
even when the sites are separated by as much as 10% of the full length of the 
vector, the desired double cut linear may not be fully resolved from contaminating 
partially digested molecules. Even a barely detectable residue of partially digested 
vector leads to an inordinately large number of re-circularized plasmids, requiring 
a large number of minipreps to be analyzed in search of the desired construct. 
This is especially troublesome when available insert fragments are present in 
limited amounts or when lower frequency events, such as triple ligations, are 
desired. 

[0007] Because of the uncertainties in obtaining reliable double digests, marker 
plasmids can be included in mixed test digestions to be sure that both enzymes are 
active and that neither is inhibited by contaminants in the vector plasmid prep or 
in a PCR product to be inserted. Suitable marker plasmids can sometimes be 
found in a plasmid collection for a given pair of enzymes but often none is 
available, limiting the value of the digest marker maneuver, especially in labs with 
small plasmid inventories. What is needed is a double digest marker plasmid or 
set of plasmids which is informative of digestion efficiency for a variety of known 
restriction endonucleases. 
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[0008] In one aspect, the invention relates to a plasmid for use in monitoring the 
efficiency of a restriction endonuclease digestion, comprising at least one spacer 
segment comprising a nucleic acid sequence that is restriction site-free and at least 
two polylinker regions containing a plurality of unique restriction sites. The 
restriction sites are distributed so that digestion of the plasmid with two restriction 
endonucleases whose sites are represented on the plasmid results in two 
fragments, each of the fragments being sufficiently different in size from the intact 
plasmid so as to be readily distinguishable from the plasmid. Generally, fragments 
that are at least about 15% shorter than the intact plasmid can easily be 
distinguished from the intact plasmid when visualized on an agarose gel. 

[0009] In another aspect, the invention relates to a plasmid for monitoring the 
digestion efficiency of a restriction endonuclease digestion comprising at least one 
spacer segment comprising a nucleic acid sequence that is restriction site-free and 
at least two polylinker regions where the polylinker regions contain a plurality of 
unique restriction sites distributed so that, for any two sites, the two sites are 
situated within different polylinker regions on at least one plasmid of a set of such 
plasmids. The polylinker regions are separated by a spacer region of restriction 
site-free nucleic acid whose length is about 15-85% of the length of the plasmid. 
Alternatively, the length of the spacer region of restriction site-free nucleic acid 
may be 20-85%, 30-85%, 40-85% or 50-85% of the length of the intact plasmid. 
Digestion of the plasmid with two endonucleases whose recognition sites are 
represented on the plasmid, therefore, results in two fragments, one of the 
fragments being at least about 1 5%-85% of the length of the intact plasmid. The 
plasmid of the present invention further comprises a replication origin and a 
selectable marker, for example, the amp R gene. In one embodiment, the plasmid 
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of the present invention comprises a vector backbone of a plasmid such as pUC, 
pBR322 or pBS. 

[0010] In a related aspect, the invention relates to a set of plasmids for use in 
monitoring the efficiency of a restriction endonuclease digestion, wherein each of 
the plasmids of the set comprises at least one spacer segment comprising a nucleic 
acid sequence that is restriction site-free and at least two polylinker regions 
containing a plurality of unique restriction sites distributed so that, for any two 
sites, the two sites are situated within different polylinker regions on at least one 
plasmid of the set and the polylinker regions are separated by a spacer segment 
whose length is about 15-85% of the length of the plasmid. 

[0011] In yet another aspect, the invention relates to a method for designing a set 
of plasmids for use in monitoring the efficiency of a restriction endonuclease 
digestion comprising: (a) identifying at least one spacer segment comprising a 
nucleic acid sequence that is restriction site-free; (b) identifying a plurality of 
restriction sites to be represented on the plasmids; (c) assigning each restriction 
site to a polylinker region on one of the plasmids such that for any two restriction 
sites there is at least one polylinker in the set that does not contain both sites; (d) 
distributing the polylinker regions on the plasmids such that the polylinker regions 
are separated by a spacer segment at least about 15%-85% of the length of the 
plasmid. 

[0012] In still another aspect, the invention relates to a method for designing a set 
of plasmids for monitoring the efficiency of a restriction endonuclease digestion 
comprising: (a) identifying at least one spacer segment comprising a nucleic acid 
sequence that is restriction site-free; (b) identifying a plurality of restriction sites 
to be represented on the plasmids; (c) determining the number of plasmids (b) and 
the number of polylinker regions (a) on each plasmid necessary to accommodate 
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the desired restriction sites, wherein the maximum number (N) of sites which can 
be represented is N = a 6 , where a is the number of polylinkers in each plasmid and 
b is the number of plasmids in the set; (e) assigning each of the restriction sites to 
a polylinker region in accordance with a template, wherein the template 
corresponds to an a x b matrix, and where each site is in a different polylinker 
from any other site in at least one of the plasmids in the set. 

[0013] In one aspect, the invention relates to a method of designing a set of three 
plasmids for monitoring the efficiency of a restriction endonuclease digestion 
comprising (a) identifying at least two spacer segments comprising a nucleic acid 
sequence that is restriction site-free; (b) identifying 27 restriction sites to be 
represented on the three plasmids; (c) numerically ordering the 27 restriction 
sites; and (d) assigning each of the restriction sites to a polylinker region as 
follows: 

(i) sites 1-9 are assigned to a first polylinker on a first plasmid, sites 
10-18 are assigned to a second polylinker on the first plasmid and sites 19- 
27 are assigned to a third polylinker on the first plasmid; 

(ii) sites 1, 4, 7, 10, 13, 16, 19, 22 and 25 are assigned to a first 
polylinker on a second plasmid, sites 2, 5, 8, 1 1, 14, 17, 20, 23, and 26 are 
assigned to a second polylinker on the second plasmid and sites 3, 6, 9, 12, 
15, 18, 21, 24, and 27 are assigned to a third polylinker on the second 
plasmid; 



6 



0410.008 



(iii) sites 1, 2, 3, 10, 11, 12, 19, 20 and 21 are assigned to a first 
polylinker on a third plasmid, sites 4, 5, 6, 13, 14, 15, 22, 23 and 24 are 
assigned to a second polylinker on the third plasmid and sites 7, 8, 9, 16, 
1 7, 1 8, 25, 26 and 27 are assigned to a third polylinker on the third 
plasmid. 

[0014] The polylinker regions are then distributed on each of the plasmids such 
that the polylinker regions are separated from each other by a spacer segment at 
least about 15% of the length of the intact plasmid. 

[0015] In a related aspect, the invention relates to a set of plasmids designed and 
constructed according to this method. The plasmids so constructed may further 
comprise additional restriction sites situated in non-poly linker regions of the 
plasmids. 

[0016] In yet another aspect, the invention relates to a method of constructing a set 
of three plasmids for monitoring the efficiency of a restriction endonuclease 
digestion comprising: 

(a) identifying at least one spacer segment comprising a nucleic acid 
sequence that is restriction site-free; (b) identifying 64 restriction sites to be 
represented on the three plasmids; (c) numerically ordering said restriction sites; 
and (d) assigning each of said restriction sites to a polylinker region as follows: 

(i) sites 1-16 are assigned to a first polylinker on a first 
plasmid, sites 17-32 are assigned to a second polylinker on the first 
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plasmid, sites 33-48 are assigned to a third polylinker on the first plasmid 
and sites 49-64 are assigned to a fourth polylinker on the first plasmid; 

(ii) sites 1 -4, 29-32, 4 1 -44 and 53-56 are assigned to a first 
polylinker on a second plasmid, sites 5-8, 17-20, 45-48 and 57-60 are 
assigned to a second polylinker on the second plasmid, sites 9-12,21-24, 
33-36 and 61-64 are assigned to a third polylinker on the second plasmid 
and sites 13-16, 25-28, 37-40 and 49-52 are assigned to a fourth polylinker 
on the second plasmid; 

(iii) every fourth site beginning with site number 1 is assigned to 
a first polylinker on a third plasmid, every fourth site beginning with site 
number 2 is assigned to a second polylinker on the third plasmid, every 
fourth site beginning with site number 3 is assigned to a third polylinker on 
the third plasmid; and every fourth site beginning with site number 4 is 
assigned to a fourth polylinker on the third plasmid. The polylinker regions 
are then distributed on each of the plasmids such that they are separated by 
a spacer segment at least about 15% of the length of the plasmid. 

[0017] In a related aspect, the invention relates to a set of three plasmids 
constructed according to this method and includes plasmids in which additional 
restriction sites have been placed in non-polylinker regions of the plasmid. 

[0018] In yet another aspect, the invention relates to a method for preparing 
plasmids for monitoring the digestion efficiency of a restriction endonuclease 
digest reaction, comprising the steps of: (a) transfecting host cells with the 
monitor plasmid described above; (b) growing the cells under conditions to 
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provide a quantity of cells containing the plasmids; and (c) purifying the plasmids 
from the cells. 

[0019] In still another aspect, the invention relates to a method for monitoring the 
digestion efficiency of a restriction endonuclease digestion reaction comprising: 
(a) adding a monitor plasmid of the type described above to a sample preparation 
of a non-monitor plasmid; (b) initiating an endonuclease digestion reaction; (c) 
allowing the reaction to proceed to completion; and (d) determining, usually by 
gel electrophoresis of the digested material, whether the monitor plasmid has been 
digested, wherein digestion of the monitor plasmid indicates digestion of non- 
monitor plasmids in the sample preparation. 

[0020] In yet another aspect, the invention relates to a kit for monitoring the 
digestion efficiency of a restriction endonuclease digestion reaction comprising at 
least one plasmid where the plasmid contains a plurality of unique restriction sites 
distributed so that digestion of the plasmid with any two restriction endonucleases 
represented on the plasmid results in two plasmid fragments, one of the fragments 
being at least about 15% of the length of the undigested plasmid. The kit may 
comprise one, two, three, four or more plasmids, as required and includes 
instructions for use of plasmid(s). The kit may additional include restriction 
endonucleases whose sites are represented on the plasmid(s) and appropriate 
buffers. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0021] FIGURE 1 is a schematic diagram of a set of four plasmids in accordance 
with one embodiment of the present invention. 

[0022] FIGURE 2 is a schematic diagram of a set of four plasmids in accordance 
with another embodiment of the present invention. 

[0023] FIGURE 3 is a photograph of an ethidium bromide stained 1% agarose 
gel. 

[0024] FIGURE 4 is an example of a chart to be included with instructions for 
use of digestion monitor plasmids indicating the appropriate plasmid of a set of 
plasmids to be used to monitor the digestion efficiency of any two endonucleases 
represented on the plasmids. 

DETAILED DESCRIPTION OF THE INVENTION 

[0025] In practicing the present invention, many conventional techniques in 
molecular biology and recombinant DNA technology are used. Such techniques 
are well known and are explained fully in, for example, Sambrook et al., 2000, 
Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York; Current Protocols in 
Molecular Biology, Volumes I, II, and III, 1997 (F.M. Ausubel, ed.). 
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[0026] In the description that follows certain conventions will be followed as 
regards the usage of terminology: 

[0027] The term "plasmid" refers to an autonomously replicating, 

extrachromosomal circular nucleic acid molecule, distinct from the normal host 
genome and nonessential for cell survival under nonselective conditions. 
Generally, artificially constructed plasmids are used as cloning vectors. The term, 
as used herein, refers to a construction of genetic material designed to monitor the 
efficiency of an restriction endonuclease digestion, in addition to being useful as a 
cloning vector. Furthermore, the term is meant to encompass intact plasmids in 
either a circular or linear configuration as well as nucleic acids derived from a 
plasmid vector. The plasmid may be capable of autonomous replication in a 
defined host organism such that the plasmid can be reproduced. 

[0028] The term "intact plasmid" implies that no genetic material has been lost 
from the plasmid. An "intact plasmid," therefore, is intended to encompass the 
full-length linear form of the plasmid that is produced by digestion with a single 
endonuclease, as well as circular plasmids. 

[0029] The term "restriction site," as it is known in the art, refers to a nucleotide 
sequence composed typically of 4, 6, or 8 nucleotides that is recognized by a 
restriction endonuclease. 
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[0030] The term "polylinker" or "polylinker region" refers to a short segment of 
nucleic acid having several unique restriction endonuclease recognition sites 
therein. Generally, the polylinker allows the cloning of other nucleic acid 
sequences into the plasmid backbone. 

[0031] The term "non-poly linker region," therefore, refers to sections of the 
plasmid other than the polylinker regions. This term is intended to encompass 
restriction site-free spacer segments, vector backbone segments and the like. 

[0032] The term "spacer segment" refers to the restriction site-free segments of 
DNA between the poly linkers. 

[0033] The term "vector backbone" refers to a portion of a plasmid derived from a 
common vector, for example, pUC19. The vector backbone usually comprises an 
origin of replication and a selectable marker, for example, a gene for antibiotic 
resistance. 

[0034] The term "restriction site-free" refers to a segment of nucleic acid in any 
particular plasmid, that contains none of the restriction endonuclease recognition 
sites represented in the polylinker regions of that plasmid. This restriction site- 
free region, therefore, remains intact when exposed to those restriction 
endonucleases represented in the polylinker regions but not necessarily when 
exposed to endonucleases not represented in the polylinker regions. The term, as 
used herein, is not meant to imply that the restriction site-free region is totally 
devoid of any restriction sites. The restriction site-free region may contain a 
restriction site for an endonuclease for which the plasmid is not intended to be 
informative of digestion efficiency. 
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[0035] The term "fully distributed" means that each restriction site to be 
monitored is separated on the plasmid from each of the other sites to be monitored 
by a restriction site-free spacer segment at least about 15%-85% of the length of 
the intact plasmid. In other words, for a plasmid to be informative of double 
digestion, for any two sites, the two sites must reside in different polylinkers on 
the plasmid. 

[0036] Inasmuch as the present invention relates to a specific type of plasmid for 
monitoring the efficiency of a restriction endonuclease digestion, that is, a 
"monitor plasmid," the term "non-monitor plasmid" refers to all plasmids other 
than those encompassed by the present invention. 

Plasmid Design 

[0037] The design of a plasmid or set of plasmids useful for monitoring the 
efficiency of a restriction endonuclease digestion must take into account the final 
criterion for an efficient double digestion, that is, the absence of the full-length 
linear form generated when only one of the enzymes cuts the plasmid. Hence, an 
important feature of the plasmid(s) of the present invention is that restriction sites 
for any two endonucleases be situated on the plasmid sufficiently distant from 
each other to ensure that, following digestion of the plasmid with those two 
enzymes, the fragments obtained are readily distinguishable from the intact 
plasmid. This enables any linear plasmid resulting from incomplete digestion to 
be easily detected. Generally, a fragment whose length is at least about 15% 
shorter than the length of the intact plasmid can be confidently distinguished from 
the linearized plasmid on an agarose gel. 



13 



0410.008 



[0038] In designing a plasmid or plasmid set useful for monitoring the efficiency 
of digestion by pairs of restriction enzymes, restriction sites are assigned to one of 
two or more poly linker regions of a plasmid. For each plasmid or plasmid set, 
there is a limited number of ways of distributing a plurality of restriction sites to 
achieve full distribution. There is, however, a number of different sets that can be 
designed according to the criteria defined herein. That is, there is no unique 
distribution, but the manner of distribution is uniquely defined. A convenient 
method of distributing the sites for any particular design is by reference to a 
template specific for each number of poly linkers (a) and plasmids (6). 
Accordingly, for each number of polylinkers and plasmids there is a different 
template. Mathematically, the positions in the template correspond to the unique 
positions in an a x b matrix. Hence all possible fully informative distributions can 
be represented in terms of the appropriate template. 

[0039] All possible distributions, therefore, can be represented as a template in 
which the N positions are numbered according to their distribution in the plasmid 
set (for example, see Tables 1-9). Toward this end, plasmids are designed 
according to basic mathematical distributions (for example, a 2 x 4 matrix for four 
plasmids each having two polylinkers, a 3 x 3 matrix for three plasmids each 
having three polylinkers, a 3 x 4 matrix for four plasmids each having three 
polylinkers, a 4 x 3 matrix for three plasmids each having four polylinkers etc.) so 
that an optimal number of restriction sites is distributed to polylinker regions of a 
plasmid or set of plasmids such that for any two restriction sites represented, the 
two restriction sites reside within different polylinkers on a plasmid separated by a 
spacer segment. 
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[0040] The polylinkers, preferably three or four per plasmid, are separated from 
each other by a spacer segment of restriction site-free nucleic acid so that digestion 
of the resulting plasmid with two restriction endonucleases represented in the 
polylinkers, generates two fragments, of which the larger is at least about 15% 
shorter in length than the intact plasmid so that it can be readily distinguished 
from intact linear plasmid. 

[0041] Limitations on the size of the poly linker regions are related to practical 
considerations, for example, practicable plasmid size and the number of 
polylinkers that will fit while still satisfying the requirement that neighboring 
polylinkers on the plasmid be separated by a spacer segment at least about 15% of 
the length of the intact plasmid. The maximum number of sites which can be fully 
distributed for any given plasmid or set of plasmids is a function of the number of 
poly linker sites in each plasmid and the number of plasmids. That number, N, is 
equal to where a is the number of polylinkers per plasmid and b is the total 
number of plasmids to which the sites are distributed. 

[0042] It is not necessary, however, that a restriction site be assigned to every 
position in the template. In some designs it may not be desirable to include the 
maximum number of restriction sites in the plasmid set In this case the 
distribution is merely a mathematically "degenerate" form of the full distribution, 
and more enzyme sites can be added later to unoccupied template positions. 
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[0043] Conversely, it is possible to assign more than one site to the same template 
position. In the resulting plasmid set, however, it would not be possible to test for 
digestion by the two corresponding enzymes, but each could still be tested pair- 
wise with all the other enzymes. 

[0044] The actual choice of assignment for sites is arbitrary, provided that each 
site is placed according to the template defined by the a x b matrix for a particular 
set of plasmids and the sites are fully distributed. Hence, a large number of site 
distributions are possible for the purpose of monitoring double digestion. 

[0045] Examples of some of the embodiments intended to be encompassed by the 
instant invention are provided. The examples given are meant to be illustrative of 
any possible combination of sites to be represented and in no way is intended to be 
limited to the particular arrangement of sites shown. 

[0046] In one embodiment, a set of two plasmids each comprising three polylinker 
sites would be informative for at least nine different restriction sites; a set of two 
plasmids each comprising four polylinker sites, as represented by Table 1, 
illustrates a distribution scheme in which 16 restriction sites can be represented to 
ensure that, for any two restriction sites represented (i.e., any of the 240 possible 
combinations), in at least one of the two plasmids, the two restriction sites to be 
monitored reside within different polylinkers. 
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[0047] In each case, the number of informative sites can be increased slightly by 
including linked sites and/or by including additional restriction sites in the vector 
backbone of the plasmid. For example, for the plasmid design shown in Table 1 
below, which shows one possible combination of 16 sites to be represented, the 
inclusion of 6 linked sites, that is recognition sites contained within other 
recognition sites (template numbers 1, 4, 6-9) increases the number of informative 
sites to 22. The advantage of this design is that only two plasmids are needed to 
monitor 478 of the 484 possible combinations of 22 frequently used enzymes. In 
this design, however, sites that are linked (e.g. EcoRI-Mfel) are both in the same 
polylinker (Polylinker A) in both plasmids and therefore, cannot be tested for 
double digestion. 



Table 1 

Two plasmids with four polylinkers: 16 distributed sites plus 6 linked sites 



Plasmid 1 


Plasmid 2 


Polylinker 


Polylinker 


A 


B 


C 


D 


A 


B 


C 


D 


1 EcoRI-Mfel 


5 Agel 


9 Sall-Xhol 


13 Hindlll 


1 


2 


3 


4 


2 Sad 


6BamHI-BglII 


10 Mlul 


14 PstI 


5 


6 


7 


8 


3 Kpnl 


7 Clal-BstBI 


1 1 Sad 


15NotI(EagI) 


9 


10 


11 


12 


4 Smal-Nrul 


8 Xbal-Spel 


12 SphI 


16 BspHI 


13 


14 


15 


16 



[0048] A set of three plasmids each having three polylinkers can be informative 
for 27 (or more) restriction sites. Preliminarily, a list of 27 restriction sites was 
ordered from 1 to 27, then partitioned into subsets of three sites and assigned to 
the first, second, and third polylinker positions (see Table 2). For example, for 
the first plasmid of the set (Plasmid 1), the first three groups, corresponding to the 
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first nine sites, were assigned to the first polylinker position (A), the next three 
groups of three sites, corresponding to the next nine sites, was assigned to the 
second polylinker position (B) and the last three groups of three sites, 
corresponding to the last nine sites, was assigned to the third polylinker position 
(C). 

[0049] For the second plasmid (Plasmid 2), the first, fourth, and seventh group of 
three restriction sites (for example, 1, 2, 3, 10, 11, 12, 19, 20 and 21) are assigned 
to the first polylinker position (A), the second, fifth, and eighth group of three 
sites to the second polylinker position (B), and the third, sixth, and ninth group of 
three sites to the third polylinker position (C). For the third plasmid of the set 
(Plasmid 3) every third restriction site from the list of 27, starting with site number 
1 (i.e. 1,4,7...), is assigned to the first polylinker position (A), every third site 
starting with site number 2 (2,5,8..) to the second polylinker position (B) and 
every third site starting with site number 3 to the third polylinker position (C). 
This partitioning of restriction sites among the three plasmids corresponds to a 
three-by-three matrix, in mathematical terms. 

[0050] Table 2 provides a template for distribution of 27 sites to a 3 plasmid 
(three polylinker/plasmid) set. 
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Table 2 



Template for 3 plasmids with 3 polylinkers (27 fully distributed sites) 



Plasmid 1 


Plasmid 2 


Plasmid 3 


Polylinker site 


Polylinker site 


Polylinker site 


A 


B 


c 


A 


B 


c 


A 


B 


c 


1 


10 


19 


i 


4 


7 


i 


2 


3 


2 


n 


20 


2 


5 


8 


4 


5 


6 


3 


12 


21 


3 


6 


9 


7 


8 


9 


4 


13 


22 


10 


13 


16 


10 


11 


12 


5 


14 


23 


11 


14 


17 


13 


14 


15 


6 


15 


24 


12 


15 


18 


16 


17 


18 


7 


16 


25 


19 


22 


25 


19 


20 


21 


8 


17 


26 


20 


23 


26 


22 


23 


24 


9 


18 


27 


21 


24 


27 


25 


26 


27 



An example of one possible restriction site distribution is shown in Table 
3. 

Table 3 

3 plasmids with 3 polylinkers (27 fully distributed sites) 



pDMla 






Polylinker A 


Polylinker B 


Polylinker C 


template # 


template # 


template # 


Hindlll 


1 


Xbal 


10 


Sad 


19 


Aflll 


2 


Bglll 


11 


Ndel 


20 


SphI 


3 


Mlul 


12 


Xhol 


21 


Agel 


4 


StuI/MunI 


13 


Mscl/Apal/Smal 


22 


BstEII/NgoMIV 


5 


Clal 


14 


Ncol 


23 


KpnI/NotI 


6 


EcoRV 


15 


Nhel 


24 


EcoRI 


7 


PstI 


16 


Avrll 


25 


Spel 


8 


SalI/HpaI(HincII) 17 


Narl 


26 


BstBI 


9 


AscI-BssHII 


18 


BamHI 


27 
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pDM2a 



Polylinker A 


Polylinker B 


Polylinker C 


Hindlll 


1 


Agel 


4 


EcoRI 




7 


Aflll 


2 


BstEII/NgoMIV 


5 


Spel 




8 


SphI 


3 


KpnI/NotI 


6 


BstBI 




9 


Xbal 


10 


StuI/MunI 


13 


PstI 




16 


Bglll 


11 


Clal 


14 


SalI/HpaI(HincII) 


17 


MM 


12 


EcoRV 


15 


AscI-BssHII 


18 


Sad 


19 


Mscl/Apal/Smal 


22 


Avrll 




25 


Ndel 


20 


Ncol 


23 


Narl 




26 


Xhol 


21 


Nhel 


24 


BamHI 




27 
















pDM3a 






Polylinker A 


Polylinker B 


Polylinker C 


Hindlll 


1 


Aflll 


2 


SphI 




3 


Agel 


4 


BstEII/NgoMIV 


5 


KpnI/NotI 


6 


EcoRI 


7 


Spel 


8 


BstBI 




9 


Xbal 


10 


Bglll 


11 


Mlul 




12 


StuI/MunI 


13 


Clal 


14 


EcoRV 




15 


PstI 


16 


SalI/HpaI(HincII) 


17 


AscI-BssHII 


18 


Sad 


19 


Ndel 


20 


Xhol 




21 


Mscl/Apal/Smal 


22 


Ncol 


23 


Nhel 




24 


Avrll 


25 


Narl 


26 


BamHI 




27 



[0051] In another embodiment of the present invention, three plasmids in which 
the desired restriction sites are distributed among four polylinker regions boosts 
the number of sites that can be represented to 64 (i.e., 4 3 ). In an example of this 
design option there are four polylinker sites: three located at the ends of two 
spacer fragments as in the pDM series, and a fourth in the BspHI site between the 
B-lactamase and ORI regions of the pUC backbone. The advantage of this 
arrangement is that 64 fully distributed sites can be represented in a set of only 
three plasmids; for example, this could include the 57 known six-base cutter sites, 
and the seven six base palindromes interrupted by a single base (e.g. BstEII). For 
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distribution in accordance with this embodiment, the following template would be 
used: 



Table 4 

Template for 3 plasmids with 4 polylinkers (64 fully distributed sites) 



Plasmid 1 


Plasmid 2 


Plasmid 3 


Polylinker 


Polylinker 


Polylinker 


A 


B 


c 


D 


A 


B 


c 


D 


A 


B 


c 


D 


1 


17 


33 


49 


1 


5 


9 


13 


1 


2 


3 


4 


2 


18 


34 


50 


2 


6 


10 


14 


5 


6 


7 


8 


3 


19 


35 


51 


3 


7 


11 


15 


9 


10 


11 


12 


4 


20 


36 


52 


4 


8 


12 


16 


13 


14 


15 


16 


5 


21 


37 


53 


29 


17 


21 


25 


17 


18 


19 


20 


6 


22 


38 


54 


30 


18 


22 


26 


21 


22 


23 


24 


7 


23 


39 


55 


31 


19 


23 


27 


25 


26 


27 


28 


8 


24 


40 


56 


32 


20 


24 


28 


29 


30 


31 


32 


9 


25 


41 


57 


41 


45 


33 


37 


33 


34 


35 


36 


10 


26 


42 


58 


42 


46 


34 


38 


37 


38 


39 


40 


11 


27 


43 


59 


43 


47 


35 


39 


41 


42 


43 


44 


12 


28 


44 


60 


44 


48 


36 


40 


45 


46 


47 


48 


13 


29 


45 


61 


53 


57 


61 


49 


49 


50 


51 


52 


14 


30 


46 


62 


54 


58 


62 


50 


53 


54 


55 


56 


15 


31 


47 


63 


55 


59 


63 


51 


57 


58 


59 


60 


16 


32 


48 


64 


56 


60 


64 


52 


61 


62 


63 


64 



[0052] In another embodiment of the present invention, a set of four plasmids 
(referred to herein as pDMl-4) was constructed wherein each plasmid contained 
three polylinker regions. A four plasmid set containing three polylinkers per 
plasmid can be used to represent any number of sites up to 81 (i.e., 3 4 ). The 
template for this arrangement is given in Table 5. 
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Table 5 



Template for distribution of 81 sites 



pDM 1 


pDM2 


pDM3 


pDM4 


Polvlinker 


Po 


ylinker 


Polylinker 


r oiy 


XILlvCI 




A 


B 


C 


A 


B 


c 


A 


B 




A 


JD 


c 


1 


28 


55 


1 


10 


19 


1 


4 


7 


1 
1 


9 


1 

J 


2 


29 


56 


2 


11 


20 


2 


5 


O 

0 


A 


J 


\J 


3 


30 


57 


3 


12 


21 


3 


6 


9 


7 

1 


Q 
O 


Q 


4 


31 


58 


4 


13 


22 


10 


13 


1 n 

16 




1 1 
1 1 


JL 


5 


32 


59 


5 


14 


23 


11 


14 


1 ^7 

17 


1 1 




1 < 
1 J 


6 


33 


60 


6 


15 


24 


12 


15 


18 


1 fx 


1 7 


1 X 
1 O 


7 


34 


61 


7 


16 


25 


19 


22 


25 


1 Q 

ly 


7H 
2U 




8 


35 


62 


8 


17 


26 


20 


23 


26 


22 


23 


24 


9 


36 


63 


9 


18 


27 


21 


24 


27 


2D 


2o 


07 


10 


37 


64 


28 


37 


46 


28 


31 


34 


iq 
2o 


2y 




11 


38 


65 


29 


38 


47 


29 


32 


35 


i 1 


jZ 




12 


39 


66 


30 


39 


48 


30 


33 


36 


1A 


1^ 


JO 


13 


40 


67 


31 


40 


AO 


11 


40 


43 


11 


1% 

J o 




14 


41 


68 


32 


41 


50 


38 


41 


44 


40 


41 


42 


15 


42 


69 


33 


42 


51 


39 


42 


45 


43 


44 


45 


16 


43 


70 


34 


43 


52 


46 


49 


52 


46 


47 


48 


17 


44 


71 


35 


44 


53 


47 


50 


53 


49 


50 


51 


18 


45 


72 


36 


45 


54 


48 


51 


54 


52 


53 


54 


19 


46 


73 


55 


64 


73 


55 


58 


61 


55 


56 


57 


20 


47 


74 


56 


65 


74 


56 


59 


62 


58 


59 


60 


21 


48 


75 


57 


66 


75 


57 


60 


63 


61 


62 


63 


22 


49 


76 


58 


67 


76 


64 


67 


70 


64 


65 


66 


23 


50 


77 


59 


68 


77 


65 


68 


71 


67 


68 


69 


24 


51 


78 


60 


69 


78 


66 


69 


72 


70 


71 


72 


25 


52 


79 


61 


70 


79 


73 


76 


79 


73 


74 


75 


26 


53 


80 


62 


71 


80 


74 


77 


80 


76 


77 


78 


27 


54 


81 


63 | 72 


81 


75 


78 


81 


79 | 80 


81 



[0053] As discussed above, less than the maximum number of sites can be 
distributed in any template. One example, as shown in Table 6, is a distribution 
of 33 sites to a template suitable for possible distribution of 81 sites, i.e., a four 
plasmid set, where each plasmid has three polylinkers. This is the pDMl-4 set, a 
variation of the pDM u l-4a set shown in Table 8. The template position number 
(template #) for each site is shown next to the enzyme name. 
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Table 6. pDMl-4 Series (33 polylinker sites) 



nDMl 




Polylinker A 


Polylinker B 


Polylinker C 


template # 


template # 


template # 


Agel 


1 


PstI 


28 


Sail (Hindi) 


55 


BstEII 


2 


AscI (BssHII) 


30 


HpaI(HincII) 


56 


Kpnl 


3 


Muni 


31 


Smal 




61 


Hindlll 


7 


Xbal 


37 


Sad 




67 


Aflll 


8 


Bglll 


38 


Ndel 




68 


SphI 


9 


Mlul 


39 


Xhol 




69 


NgoMIV 


11 


StuI 


40 


Apal 




70 


NotI 


12 


Clal 


41 


Avrll 




76 


EcoRI 


25 


EcoRV 


42 


Narl 




77 


Spel 


26 




BamHI 




78 


BstBI 


27 




MscI 




79 






Ncol 




80 






Nhel 




81 



pDM2 




Polylinker A 


Polylinker B 


Polylinker C 


template # 


template # 


template # 


Agel 


1 


Xbal 


37 


EcoRI 


25 


BstEII 


2 


Bglll 


38 


Spel 


26 


Kpnl 


3 


MM 


39 


BstBI 


27 


Hindlll 


7 


StuI 


40 


Avrll 


76 


Aflll 


8 


Clal 


41 


Narl 


77 


SphI 


9 


EcoRV 


42 


BamHI 


78 


PstI 


28 


Sad 


67 


MscI 


79 


AscI (BssHII) 


30 


Ndel 


68 


Ncol 


80 


Muni 


31 


Xhol 


69 


Nhel 


81 


Sail (Hindi) 


55 


Apal 


70 




Hpal(HincII) 


56 


NgoMIV 


11 




Smal 


61 


NotI 


12 
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pDM3 




Polylinker A 


Polylinker B 


Polylinker C 


template # 


template # 


template # 


Agel 


1 


Muni 


31 


Hindlll 


7 


BstEII 


2 


StuI 


40 


Aflll 


8 


Kpnl 


3 


Clal 


41 


SphI 


9 


NgoMIV 


11 


EcoRV 


42 


EcoRI 


25 


NotI 


12 


Sad 


67 


Spel 


26 


PstI 


28 


Ndel 


68 


BstBI 


27 


AscI (BssHII) 


30 


Xhol 


69 


Smal 


61 


Xbal 


37 


Avrll 


76 


Apal 


70 


Bglll 


38 


Narl 


77 


MscI 


79 


MM 


39 


BamHI 


78 


Ncol 


80 


Sail (Hindi) 


55 




Nhel 


81 


Hpal(HincII) 


56 







pDM4 


Polylinker A 


Polylinker B 


Polylinker C 


template # 


template # 


template # 


Agel 


1 


BstEII 


2 


Kpnl 


3 


Hindlll 


7 


Aflll 


8 


SphI 


9 


EcoRI 


25 


NgoMIV 


11 


NotI 


12 


PstI 


28 


Spel 


26 


BstBI 


27 


Muni 


31 


Bglll 


38 


AscI-BssHII 


30 


Xbal 


37 


Clal 


41 


EcoRV 


42 


StuI 


40 


Hpal(HincII) 


56 


MM 


39 


Sail (Hind) 


55 


Ndel 


68 


Xhol 


69 


Smal 


61 


Narl 


77 


BamHI 


78 


Sad 


67 


Ncol 


80 


Nhel 


81 


Apal 


70 






Avrll 


76 






Msd 


79 







Expansion of pDMl-4 plasmid set to include 81 sites 

[0054] The arrangement illustrated in Table 6, above, is actually informative for 
43 sites. Thirty-three (33) sites are folly distributed within the polylinker regions 
of the plasmids and "informative" for monitoring double digestions. Polylinker 
sites are listed according to their template number, rather than in other order they 
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are arranged in the polylinker. Additionally, there are 3 sites located in the vector 
backbone (BspHI, Pvul and Fspl) that are fully informative and 7 other sites (Seal, 
PvuII, AcII, Pcil, ApaLI, Asel and Dral) in the vector backbone that are 
informative for nearly all combinations. Hindi sites within Sail and Hpal are 
informative in this arrangement. 

[0055] The pDM series shown above in Table 6 can be expanded in three stages 
to include up to 81 (=3 4 ) fully distributed sites. One possible version including all 
six-base palindromes and several other commonly used sites is shown in Table 8. 
Distribution of sites to the 8 1 available positions would be in accordance with the 
template shown in Table 5. Construction of pDMl-4 is described in detail below. 
Expansion of the pDMl-4 plasmid set could be accomplished as follows: 

[0056] (1) Insert new sites in polylinkers of pDMl-4. The thirty-three (33) 
positions already occupied in the template in the pDMl-4 set are shown in Table 
7. New sites to be inserted are assigned to one of the remaining 48 positions of 
the 81 possible in the pDM template (see Table 7 ). For example, the Apol site is 
assigned #4 and included in polylinker A of pDMl, A of pDM2, B of pDM3 9 and 
A of pDM4, as dictated by the template. Pmll is assigned #5 and placed in 
polylinkers A, A, B, and B, respectively; etc. 
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Table 7 

Template for distribution of 81 sites pDMul-4 



pDM IT l 


pDM TJ 2 


pDM TJ 3 


pDM TI 4 


Polylinker 


Polylinker 


Polylinker 


Polylinker 


A 


B 


C 


A 


B 


C 


A 


B 


C 


A 


B 


C 


1 Agel 


28 Psti 


55 Sail 


1 


10 


19 


1 


4 


7 


1 


2 


3 


2 BstEII 


29 


56HpaI 


2 


11 


20 


2 


5 


8 


4 


5 


6 


3 Kpnl 


30 AscI 


57 


3 


12 


21 


3 


6 


9 


7 


8 


9 


4 


3 1 Muni 


58 


4 


13 


22 


10 


13 


16 


10 


11 


12 


5 


32 


59 


5 


14 


23 


11 


14 


17 


13 


14 


15 


O 


11 
DD 


DU 


o 


1 ^ 
iJ 




1 7 




18 


16 


17 


18 


7 Hindlll 


34 


61 Smal 


7 


16 


25 


19 


22 


25 


19 


20 


21 


8 Aflll 


35 


62 


8 


17 


26 


20 


23 


26 


22 


23 


24 


9SphI 


36 


63 


9 


18 


27 


21 


24 


27 


25 


26 


27 


10 


37XbaI 


64 


Zo 


in 


40 




^ 1 
D 1 










1 lNgoMIV 


38B g iii 


65 


29 


38 


47 


29 


32 


35 


31 


32 


33 


12NotI 


39 mm 


66 


30 


39 


48 


30 


33 


36 


34 


35 


36 


13 


40 stui 


67 Sad 


31 


40 


49 


37 


40 


43 


37 


38 


39 


14 


41 Clal 


68 Ndel 


32 


41 


50 


38 


41 


44 


40 


41 


42 


15 


42 EcoRV 


69XhoI 


33 


42 


51 


39 


42 


45 


43 


44 


45 


16 


43 


70ApaI 


34 


43 


52 


46 


49 


52 


46 


47 


48 


17 


44 


71 


35 


44 


53 


47 


50 


53 


49 


50 


51 


18 


45 


72 


36 


45 


54 


48 


51 


54 


52 


53 


54H 
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19 


46 


73 


55 


64 


73 


55 


58 


61 


55 


56 


57 


20 


47 


74 


56 


65 


74 


56 


59 


62 


58 


59 


60 


21 


48 


75 


57 


66 


75 


57 


60 


63 


61 


62 


63 


22 


49 


76 Avrll 


58 


67 


76 


64 


67 


70 


64 


65 


66 


23 


50 


77NarI 


59 


68 


77 


65 


68 


71 


67 


68 


69 


24 


51 


78BamHI 


60 


69 


78 


66 


69 


72 


70 


71 


72 


25 EcoRI 


52 


79 MscI 


61 


70 


79 


73 


76 


79 


73 


74 


75 


26 Spel 


53 


80ncoi 


62 


71 


80 


74 


77 


80 


76 


77 


78 


27BstBI 


54 


81 Nhel 


63 


72 


81 


75 


78 


81 


79 


80 


81 



[0057] For construction of the plasmids, 12 oligonucleotides containing the new 
sites are synthesized and ligated into the 12 poly linker sites of the pDMl-4 
plasmids using methods well known to those of skill in the art. 



[0058] (2) Install new restriction site-free fragments. To make all the polylinker 
sites unique, the two restriction site-free segments in pDMl-4 would be replaced 
with fragments (as described above) generated by PCR from segments of genomic 
DNA lacking the new sites of interest. 

[0059] (3) Eliminate redundant sites from vector backbone. Some sites which 
would normally appear in the pUC vector but which are now included in the new 
polyinker regions are eliminated, using various standard methods well known to 
those of skill in the art. 
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Table 8 

pDM^l-4 Expanded 4-pIasmid set with 72 sites (and 9 open positions) 



pDM XI 1 


pDM TJ 2 


pDM TJ 3 


pDM TJ 4 


Polylinker 








A 


B I 


c 


A. 1 


B 






B 




A. 


B ( 




1 Agel - 


IS Pstl-Sbfl_ ! 


55 Sall-HincII 




10 


19 




l 


1 


i : 


2 : 


3 


2 BstEII t '* 


29 BssHII-AscI 


56HpaI-HincII \ 


l 


ii : 


20 : 


2 


5 




\ '. 


5 


5 


3 Kpnl 


30 Ascl_ 


57 


3 


12 


21 : 




5 < 


? 


i 


[ 


9 


4 Apol 


3 1 Muni 


58 ApaLI* 


4 


13 


22 


10 


13 


16 


10 


11 


12 


5 [taatta]* 


32 Nrul 


59 [ttgcaa] 


5 


14 


23 


11 


14 


17 


13 


14 


15 


6 AcII* 


33 PvuP-Safl 


60 Asel* 


6 


15 


24 


12 


15 


18 


16 


17 


18 


7 Hindlll 


34 Bell 


61 Smal-Srfl 


7 


16 


25 


19 


22 


25 


19 


20 


O 1 

Zl 


8 Aflll 


35 Afel 


62 BlpI 


8 


17 


26 


20 


23 


26 


22 


23 


24 


9 SphI 


36 [cgcgcg] 




o 


18 


27 


21 


24 


27 


25 


26 


27 


10 Pmll 


37XbaI 


64 RsrII 


28 


37 


46 


28 


31 


34 


28 


29 


30 


1 1 Nael-Fsel f 


38Bgin 


65 SexAI 


29 


38 


47 


29 


32 


35 


31 


32 


33 


12 Eagl-NotI 


39miuI 


66 Sse8647I 


30 


39 


48 


30 


33 


36 


34 


35 


36 


13 Aatll 


40 stui 


67 Sad 


31 


40 


49 


37 


40 


43 


37 


38 


39 


14 SnaBI 


41 Clal 


68NdeI 


32 


41 


50 


38 


41 


44 


40 


41 


42 


15 PvuII* 


42 EcoRV 


69 Xhol 


33 


42 


51 


39 


42 


45 


43 


44 


45 


16 [tagcta] 


32 Fspi* 


70ApaI 


34 


43 


52 


46 


49 


52 


46 


47 


48 


17 Sspl 


33 Seal* 


71 SanDI 


35 


44 


53 


47 


50 


53 


49 


50 


51 


18 Pcil* 


34 BsiWI 


72 


36 


45 


54 


48 


51 


54 


52 


53 


54 


19 BspHI* 


35 BsrGI 


73 Pmel-Dral* 


55 


64 


73 


55 


58 


61 


55 


56 


57 
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20 BspEI 


36 [atatat] 


74 Swal-Dral* 


56 


65 


74 


56 


59 


62 


58 


59 


60 


21 SacII 


43 [ctatag] 


75 


57 


66 


75 


57 


60 


63 


61 


62 


63 


22 


44 Bstzni 


/ D Avrll 


JO 


D / 


1 O 




67 


70 


64 


65 


66 


23 


45 Psil 


/ / Narl 


jy 


DO 


77 

/ / 


OJ 


VJO 


71 

/ 1 


67 


68 


69 


24 


46 Nsii 


78BamI-n 


60 


69 


78 


66 


69 


72 


70 


71 


72 


25 EcoRI 


52 


79 Mscl_ 


61 


70 


79 


73 


76 


79 


73 


74 


75 


26 Spel 


53 


80 Ncol 


62 


71 


80 


74 


77 


80 


76 


77 


78 
























81 



[0060] Sites with asterisks are ordinarily present in the vector backbone and would 
need to be removed to be fully informative. ^Bracketed sequences indicate six 
base palindromes for still undiscovered enzymes. * Double sites represent eight- 
base cutter sites with internal six-base cutter sites: applies to Fsel, NotI, Sbfl, 
AscI, Sgfl, Pmel, Swal, and Srfl. *BstEII ? Blpl ? Bsu36I ? RsrII, SexAI, Sse8647I, 
and SanDI are palindromes interrupted by a single nucleotide. 

[0061] Dral 5 contained within both Swal and Pmel is fully distributed in this 
arrangement. In the same way, so are the Hindi sites, which are isoschizomeric 
with Sail and Hpal. 

[0062] Another example of a distribution of less than the maximum number of 
sites is give in Table 9. 
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Table 9 

pDM y plasmid set (see Figure 1 and 4) 



pDM v l 


Polylinker A 


Polylinker B 


Polylinker C 


Hind III 


Bglll 


Xhol 


Aflll 


Xbal 


Sac I 


SphI 


Mlul 


Ndel 


Not I 


Clal 


Apal 


NgoM IV 


EcoRV 


Smal 


BstE II 


Muni 


Nhel 


Age I 


StuI 


MscI 


Kpnl 


PstI 


Ncol 


Spel 


Hpal 


Narl 


BstBI 


Sail 


Avrll 


EcoRI 




BamHI 




pDM v 2 


Polylinker A 


Polylinker B 


Polylinker C 


Hind III 


Bglll 


EcoRI 


Aflll 


Xbal 


BstB I 


SphI 


Mlul 


Spel 


Not I 


Clal 


Apal 


NgoM IV 


EcoRV 


Smal 


BstE II 


Muni 


Nhel 


Age I 


StuI 


Msc I 


Kpnl 


Ndel 


Ncol 


PstI 


Sac I 


Narl 


Hpal 


Xhol 


Avrll 


Sail 




BamHI 



30 



pDM Y 3 



Polylinker A 


Polylinker B 


Polylinker C 


Bglll 


BamHI 


EcoRI 


Xbal 


Avrll 


BstBI 


Mlul 


Narl 


Spel 


Not I 


Clal 


Apa I 


NgoM IV 


EcoRV 


Smal 


BstE II 


Muni 


Nhel 


Age I 


StuI 


Msc I 


Kpnl 


Ndel 


Ncol 


PstI 


Sac I 


SphI 


Hpal 


Xhol 


Aflll 


Sail 




Hind III 




pDM v 4 


Polylinker A 


Polylinker B 


Polylinker C 


Hind III 


Bglll 


Xhol 


Age I 


Ail II 


EcoRV 


Xbal 


Clal 


Mlul 


Sac I 


NgoM IV 


Bst BI 


Mun I 


Narl 


Nhel 


Msc I 


Ncol 


Not I 


Smal 


BstEII 


SphI 


Apa I 


Ndel 


Kpnl 


Avrll 


Hpal 


BamHI 


StuI 


Spel 




PstI 


Sail 




EcoRI 







[0063] In this embodiment, sites are distributed in three polylinkers in each of four 
plasmids, as shown in Figure 1. In this plasmid set, some sites are "linked," that 
is, they are present in the same polylinker in all four plasmids (e.g. Sail and Hpal, 
or NgoM IV and BstEII.) Hence, these plasmids can't be used for testing 
digestion of those two pairs of enzymes. A summary of which plasmids in the set 
can be used for any given pair of sites is tabulated in Figure 4; a dot represents 
linked sites. 
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Construction of Plasmids 



Suitable Vectors 

[0064] Due to the relatively high copy number of plasmids produced, plasmids of 
the pUC and pBS series are most preferred as vectors for the construction of the 
digestion monitor plasmids of the present invention. Other suitable plasmids, 
however, include, without limitation, other plasmids of bacterial origin, for 
example pBR322. 

Vector backbone 

[0065] A commonly used plasmid is selected to provide the vector backbone that 
contains the genetic elements needed for replication of the plasmid in host cells 
and for genetic selection in transformed cells. The vector backbone is kept as short 
as possible, and should not contain sites that are contained in the poly linkers of the 
digestive monitor, though these can usually be removed, if necessary, without 
affecting vector functions. A convenient starting vector is pUC19, although other 
vectors are equivalent for purposes of constructing digestion monitors. The 
segment between nucleotides # 474 and # 2501 of pUC19 is used for all the 
constructions described herein, with the polylinker and spacer segments assembled 
and inserted between these points. 

Construction of polvlinkers 

[0066] Polylinkers are constructed, using techniques well known to the skilled 
artisan, by annealing together synthetic oligonucleotides comprising the desired 
recognition sites. These polylinker regions are then ligated into the vector 
backbone using techniques well known to those of skill in the art. 
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Construction of spacer segments 

[0067] Spacer segments are generated by PCR amplification of restriction site-free 
DNA sequences, for example, sequences obtained from the published genome of 
S. cerevisiae or K sapiens. During the amplification, restriction sites are added to 
the ends of the fragments so that they can be ligated into the vector backbone. The 
spacer segments are at least about 15% of the length of the completed plasmid, 
and are used to separate the polylinker regions of the plasmid. A plasmid that is 
around 4kb will, therefore, contain restriction site-free regions at least about 600 
p bp in length. 

% [0068] Where more than one spacer segment is required for the construction of a 

? - plasmid, preferably, the nucleotide sequence of the spacer segments are different, 

to avoid recombination and deletion events between the spacer segment 
f™ sequences. 

p Identification of restriction site-free segments 

[0069] In one embodiment of the present invention, as exemplified by the pDMl- 
4 and pDrriyl-4 sets (Tables 6 and 9, respectively) the restriction site-free spacer 
DNA segments chosen for insertion between the three polylinker sites were an 891 
bp and a 1020 bp segment located on Chromosome V of S. cerevisiae at 
nucleotide numbers (NT#) 104477-105368 and 108958-109978, respectively. 
These were identified by testing a selection of segments chosen at random from 
the S. cerevisiae sequence database, using DNasis software to locate restriction 
sites. For larger numbers of restriction sites a more systematic method is needed 
to identify restriction site-free segments. For example, to obtain spacer segments 
for the 72 site-pDM u l-4 set (Table 8) it was necessary to obtain DNA segments 



33 



lacking all of the 72 sites. Fragments of -1000 bp are statistically unlikely to 
occur even in the human genome, and also difficult to search for. Hence, 
fragments of ~500bp were identified, since these are several orders of magnitude 
more common in human DNA. Thus, to generate the lOOObp spacer segments, 
pairs of 500bp fragments can be amplified by PCR from a human BAC clone, 
ligated together, and inserted between the poly linkers of the pDMl-4 plasmids. 
These can be ligated and inserted into the restriction site-free positions of the 
backbone of the pDMl-4 set during construction. 

Identifying suitable spacer fragments 

[0070] Any nucleic acid sequence which contains none of the restriction sites 
represented in the polylinker regions of the plasmid may be used for the restriction 
site-free region. The following DNA fragments are examples of suitable 
restriction site-free segments for practicing the method of the present invention 
and have been derived from chromosome V of the human genome. Each was 
chosen by a procedure, as described below, which locates fragments lacking any of 
the 64 possible 6bp palindromes. The "Gene Construction Kit" software was used 
to locate each of the possible standard 6bp non-degenerate palindromic restriction 
sites (i.e. all 64 6bp palindromes) in the 134,336 bp sequence contained in the 
BAC plasmid CIT-HSPC_235N17. The resulting list of sites, numbered by 
position in the sequence were imported into EXCEL. The distances between sites 
were then listed in the Excel spread-sheet as fragments, ordered by size, and 
identified by positions in the sequence (from 1 to 134,336). The largest eight 
sequences are listed below, ranging from 459 to 639 bp. 
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72394-73038 

T^J^T^AAAATAATAATGTTTTTACTTTTTTAAGTATTGGAAGTACCTTAGT^AATCATTGACTTTG 
CCCAACTGCCCCATTTTACAGTTGTGGAAACCGAGGTCATAGAATTGGTAAATTACGAAGAAAACT 
GTGTTTCCTAGTAGGTCTCCCTGCCTTCACTCTGCCTCCACCACCAGGAGTCTCTGCACCCAGTCA 
TCTTTCTAAGGGGTGTGCCATCCATCACTCCCCACGCAAAACCTCTCTGTGCCCTCCAATGCAGCT 
AGACTGAAGCCTGTGATGTCAGTGTGGTCAGATCCCTGCAACCTCAGGTTGTGTTCATGGCACACT 
GTGCTTCTGCTACCCTGGCCTTCTCTCTGCTGTGTGAACACACTAAGGTTTAACCCTTCCCGTTGT 
CTTGGAGGGGAAAATTCTCCCAGATATTCAGGCTTCTTTGTGTCATTCAGTCTCACTCAGCTCAAA 
GGGCACTTCCTCTGGGCTGCCCTAACCCCCACCAGACACCCAAACTAGATGCACAACGCTCGTCAC 
TCTTTCCCATCAGCCTGATTGGATGCCTCTAAAGCATGTACCACTATCTGTACCACTGCTCATTTA 
TTCATTGTTCATGTTTATTGTTTGTGTCCAATCACTGACAGGAAGCTGCAG (SEQ ID NO.l) 



133197-133749 

i^GC'rrTTAAGACAGAATCATAGACCATAAAACTATTTAATAAGTACAATGTACTAAAAATATCCT 
TTACTGAGACAGATTGGTGAAATTAAAAGAGCACCAGCTTAAAGATCAGGAAGCCAGATGCTGTTC 
TCTGCTCAGGCCGAGCCTCAATCATGTGGCCCTGGGCAGGCACCTCCTCAACTTGACCTCAGTTTT 
GCCCTTTTTACAATGGTATCTATAAGTTCTTCTTGGCTCTGCTATTCTGGAATTATCTTATGTAGA 
ATAAGTCTTCCCAAGCTGTGTGGGGCTTTTCCTGGTAGATTTGAGGGAAGTTTTGTTCTGTTTTGT 
TTTATTGTTTGCTTACCCTGCTACTGCCAGTGAAGTCAACACTACAAGCAGACAGTAAGCCAGGAA 
ACATTTCTCCCTGTCAGGTCAGCACATCCCATTAGGTGGATCTGGTGCTCAAGTTTATTAGATCAG 
GAGACCGATGCTGGGGAAGGCTCACAGAGTGGTCTGAGTCAGTTGCTCTACCATGAGGTTGGAGTG 
TAGTAGATT CAACAT CCTGTGTAC A (SEQ ID NO. 2) 

7~72o-762SC 

TC ATGAGGT CAGGAGATTGAGAC C ATC CTGGCTAACACGGTGAAACC C CATCT CTAC TAAAAATAC 
AAAAAATTAGCCGGGCGTGGTGGTGGGCACCTGTAGTCCCAACTACTCGGGAGGCTGAGGCAGGAG 
AATGGCATGAACCTGGGAGGCAGAGCTTGCAGTGAGCCTCAGATTGCACCACTGCACTCCAGCCTG 
GGCAACAGAATGAGACTCCATCTGAAAAAAAAAAAAAAACAACTTGTCCAAGTTTATATGGCCTGG 
TACAGGCAGGATTTAGGCAACTGAATCCACAGGTGCTGCTGAACCACCAGGAGACCTGCTGCCCAA 
GTAACTGCAAACGGGGCACCAGGGACTGTCTAGGACCACAGGTCGAGTTCCACCTCTGCCAACCTA 
GTTTGTTAGCCCTTCCTTCCTTCCTTCCTTCCTTCCTCCTTCCTTTTCTGTTCTTGCCTGCCAGCT 
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TTCCTCTTTCTACATTTCTTCCTCCTCCTTCCTTCTCTCTGTCTCTTCTATCCTACATTAAGTATA 
AGTTTTTCATGA (SEQ ID NO. 3) 

94154-94684 

CTGCAGCAGGCATCTCTTGGTGCCATGAGTCCTGGTGGGCCTTATTGATTCCTTATTCATTTCTTA 
TCACCCCATGTGAATCAGATTCAGTTGCTTCACATTTTCTTCACTGCTCTTATCACTGCCCGATAT 
TATATTACAGTTGTGAGTTTTGCACCTCTTATATTAAGACAGTGTCTGCCACATAGTAAGCACTTA 
GTATTTGCTGAAAGTTGTAAAAGTGCATCAATGAGTATCCCACAGTGCCGGGCACATAATAGATAT 
TCCATAAATTGTTGTAAAATAGCATTTCCTCTCTGTCCAGGGAACAGGGATGAGGGTGGTATAAAT 
GGGGAGCATTTTGTTCAGGGATGTTTTCTGGATGTGGCATTTGAGCTAGACCTTAAAAGATGGAGT 
ACAATTCCACAAGGAAGGCTTAGTAGTTGGGCATTCCAAACAAAAAGGACAGGTGTTTAGACATGG 
AAAGC ATTAGGGACATTTGAGAAATTGGGAATAGGT CAGTTTAGC TGGCAGACTGGAAATACAAAG 
AGGAATTC (SEQ ID NO. 4) 

111084-111592 

AGTAGTTATTACATTTTAGGTCTTATTTAATTGTGTGTGTTCCTCTCCATTCCCACCAACTAGGGC 
CTGACACATAGGGGTTCAATAATTGTCAAGTGATTGACAGAATGAATGAATGGATGGATGAGTGAA 
AAAGTCTCTCGATTTCCAGTGTGTATTCTCTCTAATATCTTCTACATTCTACACTGAAATTGTCTT 
TTTGAAAGCCTGGACTTCTTCAGTGGCTTGTCATTGCCAGTGGATAAAATGCAGACTTTTCATCTG 
TGCATTCAAGAACTACCACATATAGTCTCAGCCTACCATTTCTCTTTTTTTTTTTTTTTAGATGGA 
GCCTTTCTCTGCTGCCCAGGCTGGAGTGCAGTGGCATGGTCTCGGTTCACTGCAACCTCTGCCTGT 
T GGGTG C AAGAGAT T C T C C T G C CT C AGC C T C C TGAGT AG CTGGGATT AC AGG C AC C C AC C AC C ACA 
CCCAGCTAATTTTTGTATTTTTAGTAGAGATGGAGTTTCACCATGTTGGCCA (SEQ ID 

NO. 5) 

76528-77029 

GAATTCATGTTAATC C AC ATAGAGAATTGAGAGATCAGAGCCTT C AATAAATGC CAAC CAAC ATTT 
ACAGTGTATTGTCTGTCAGCACAGTGCTGTGACAGTGCTGGGGATAAAACAACACAGACGCCAGGG 
CATCAGACCCTACCCCCACGGTTGCTTTATGTACTCTCAGGGCACAGGGACGCTTTCATTCTGTGC 
TCTTATCACACTGACTTGTGTCATCTCTTGACTGTCAGTCTCTCCCACTGAACTACAAACCTTTTG 
AGAGCAGAAGCCCTTTTTCTTTTATTGTTTTCTCAGCATTTCATATCCTATTGCACAAATCAGGAC 
TTGGCACATAATAGATGCTC CATAAGTAATGGTTGAATAAATGAATAAATAC ATAGTACC CGTATT 
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AGTTTGCTAGGGCTTCCATAACAAAATACCACAGCCTGGGTGGCTTACACAATGGAATTTTATTTT 
CTCATGGTCTTGGAGGCTGGAAGTCCAAGATCAAGGTGCCTGCAG (SEQ ID NO . 6 ) 

36677-37159 

AAATTTACGCAGAAGCAGGATACAGAAAGCAGCCAGAAGTGGAGCAGCACCAGCCGGGGTGGGGAC 
CAAGACCTCCCACCCTCCTTGGTGTCTGTCTGCCTTTGAGGTCCCTCTGCCAAAGTGGCGCTCTGC 
TGAGCAAAGTAATCCCTCAGGGCACTCCAACTCTGAGACAGAATGATTTATAGCCCTGTTAATCCA 
CCAGGCTGTCAAAAACGGCCACATCAGCAGACATACACAGAGACATGCAGTCACATACACTCAGGA 
CAAAAAATAAGCCCCAGAAACTGCTTGAGCAAGCTCAGGCTTTTCTCTAGCACCTTCCTCATTTGT 
TCAAGTTTTTGCCATCTTTATTTTCATTTCCTGGTTTCTTCTCACATCCTTCCTCTTTTCCCCTCC 
AAGTTACTAAAAATTCTAACAATTTCTACTTTTACTTCTTTGTTGTTATCTGCGGCACATCGTTGC 
CCCACCCCCACTCTTGGTCGCTATAG (SEQ ID NO. 7) 



89707-90178 

TTGCAAGCTGGAGACCCAGAAAAACTGATAATGTAGTACAAGTCCAAACATTGGCAGACTTGAGAC 
CTGGGAAGAGCCAATGTTTTAGTTTAAGTCTGTCAGTAAAAAGAAACCAATGTGCCAGCTCAAAGG 
CAGACAAGCATGGGAATTTTGTATTATAGGGAGGAAGGTCAGCGTTTTTTTCCTATTCAGGTCTTC 
AATGGATTGGACGGGAACCATCCACATTAGGGAGGGCAATCTGCTTTACTTAGTCTCCCAAATCAA 
ATGTTAATCT C ATC CAGAAATATCAGCAC ACAC AAC CTTAGAATAATGT CTGACCAAATGT CTGGG 
CACCCAATAGCTCAGTCAAGTTGACACATAAAATTAACCATCACTTGGCCGGACGCTGTGGCTCAC 
ACCTGTTATCCCAGCACTTTGGGTGGGTGGATCACCTGAGGTTCCTTAGGAATGTAAAGAGGTGAG 

CACTTAGAATCTAGA (SEQ ID NO. 8) 
5757-6216 

AAGCTTACAACTTGCTGGTGCTGTTTCCTGGGATTTACTCCCTGCTCTGGAGTATCTCTAAACCTT 
TGGAGGTGCTACCACATGCCCCACCCTCACCCCAGGTTTTGATGACCACGCCATCTTTTAGGAGTT 
CCCCTTATTTTTATATCCTCATCCTTTCTCTATCCATGCTCAGTCATGGTCAATGTGAAATCTGTG 
CTGTGGCTTTCCTGGCTTGGCTCTTCATAGATAAACCTATAAAGCCAAGATTTGAGAAATCTTTCT 
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAATAAAGTTGGTGTTCTTTTTTTTTTCTTTAGCAA 
ATTGGCAACATTTCCTATCAGATTATGTATTGTTCACAGGCTATACAAACTCTAGGAACTATCAGG 
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GGTTATTTGGAAGAAAAACAACTGGTGTACTATTTTTCTAAATACTACACCTACAAAACATCCAAT 
ATT (SEQ ID NO. 9) 

[0071] Once suitable restriction site-free sequences have been identified, two or 
more may be ligated, using techniques well known to those of skill in the art, to 
obtain a single site-free sequence of the appropriate length, that is, at least about 
15% of the length of the completed plasmid, to separate the poly linker regions of 
the plasmid. A plasmid that is around 4kb will, therefore, contain restriction site- 
free regions at least about 600 bp in length. 

EXAMPLE 1 

[0072] In one embodiment, a set of four digestion monitor plasmids, referred to 
herein as pDM y l-4, as shown in Table 9, was constructed as follows. 

[0073] The region between polylinkers 1 and 3 corresponds to the segment of 
pUC19 between the Sspl and Hindlll sites, containing the origin of replication and 
the ampicillin resistance gene (nt 474 to 2501, 2.03 kb). The regions between 
polylinkers 1 and 2 and polylinkers 2 and 3 are segments from the S. cerevisiae 
genome corresponding to chromosome V, nt 108955-109978 and nt 104477- 
105368 (1.02 and 0.89 kb respectively). The plasmids also comprise three 
polylinker segments of about 50-60nt each, as described. 

Construction of pDM y l 

[0074] pUC19 was linearized with Ssp I and Hind III, end-filled with Klenow 
polymerase and ligated to oligo RSB (see Table 10 below) to generate pUC-RSB. 
PCR fragments were amplified from two regions of Chromosome V (fragment A: 
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ntl08955 to ntl09978; fragment B: ntl04477 to ntl05368) using the oligos logo- 
s' and 1080-3' and 898-5' and 898-3' (Table 10) Fragments A and B were digested 
with EcoRI and Sail and Sail and BamHI respectively and inserted successively 
into the corresponding sites in pUC-RSB to generate pDCMa. An unexpected 
Mfel site (i.e. one not present in the published SGD sequence was found in the B 
fragment and eliminated by digesting with Mfel followed by end-filling with 
Klenow and religating to generate pDCM This plasmid was digested with Eco RI 
and Hind III and ligated to the pi- 1 oligo, generating pDM y l-l. pDM y l-l was cut 
with Sal I and Bgl II and ligated to the pl-2 oligo generating pDM y l-l,2. pDM y l-l 
was digested with Xho I and Bam HI and ligated to the pi -3 oligo, generating 
pDM y l-l 5 3. pDMyl-1,2 and pDMyl-1,3 were each digested with Eco RI and Bgl 
II; the fragment containing the vector backbone of pDM y l-l,3 was ligated to the 1 
kb EcoRI-Bglll fragment of pDMy 1 -1 ,2, generating pDM y 1 a. An unexpected Ndel 
site in fragment B was eliminated by partial digestion with Ndel followed by 
Klenow end-filling and religation, generating pDM y l . 

Construction of pDM y 4 

[0075] To create pDMy4, pDCM was cut with Eco RI and Hind III and ligated to 
the p4-l oligo, generating pDMy4-l. pDM y 4-l was digested with Sal I and Bgl II 
and ligated to the pDM y 4-2 oligo. The resulting pDM y 4-l,2 was then cut with Bam 
HI and Xho I and ligated to the p4-3 oligo generating pDM y 4a. The unlisted Ndel 
site was eliminated as described above, generating pDM y 4. 

Construction of pDM y 22 

[0076] To create pDM y 2, pDM y l was cut with Eco RI and Spe I and ligated to the 
p2-l oligo, generating pDMy2-l. This plasmid was digested with Sal I and Pst I 
and ligated to the p2-2 oligo, generating pDM y 2-l,2. This plasmid was partially 
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digested with Xhol, then fully digested with Nhel and Apal and ligated to the p2- 
3 oligo. A plasmid isolate containing the oligo inserted at the correct position was 
identified and designated pDM^. 

Construction of pDM y 3 

[0077] To generate pDM y 3, pDM y 2 was digested with Nco I and Bam HI and 
ligated to the p3-3 oligo, generating pDM y 3-3. This plasmid was digested with 
Bgl II and Mlu I and ligated to the p3-2 oligo, generating pDM y 3-2,3. This 
plasmid was partially digested with Hind III, then fully digested with Ngo MIV 
and ligated to oligo p3-l . A plasmid isolate with the oligo in the correct position 
was identified and designated pDM y 3-l,2 ? 3-XbBg. To eliminate a superfluous 
Xbal site remaining from the earlier stages the plasmid was partially digested with 
Xbal, end-filled with Klenow and religated. A plasmid isolate containing a single 
Xbal site in the correct position was was identified and designated 01^3-1,2,3- 
Bg. To replace a superfluous Bglll site with an Nrul site this plasmid was partially 
digested with Bglll and ligated to the Bgl IIx-Nru oligo. A plasmid isolate 
containing a Bglll site in the correct position was identified and designated 
pDM y 3. 

[0078] The sequence of each of the three multiple restriction site regions in each 
plasmid was confirmed by Sanger sequencing. 
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Table 10 

Oligonucleotides used in construction of pDM y l-4 



RSB+ gaattcaaaagtcgacaaaaggatcc (seq id no.io) 

RSB- ggatccttttgtcgacttttgaattc (seq id no.ii) 

1080-5' gagagaattctataaagcttcgtattccaatggggagc (seq id no. 12) 

1080-3' gagagtcgacagaaaacattctctagggattacg (seq id no.13) 

898-5' gagagtcgactataagatctccataccattgtagtattgactactaactac (seq id 

NO. 14) 

898-3* gagaggatcctatactcgagtagaaaaccagattttgcctc (seq id no. 15) 

pl-l+ agcttaagcatgcggccgccggcggtaaccggtaccactagttcg (seq id no.16) 

pl-1- aattcgaactagtggtaccggttaccgccggcggccgcatgctta (seq id no. 17) 

pi -2+ gatctgtctagacgcgtatcgatatcaattgaggcctgcagctgttaac (seq id 

NO. 18) 

pi -2- tcgacgttaacagctgcaggcctcaattgatatcgatacgcgtctagaca (seq id 

NO .19) 

pi -3+ gatcctaggcgccatggccagctagcccgggcccatatgagc (seq id no. 20) 

pl-3- tcgagctcatatgggcccgggctagctggccatggcgcctag (seq id no. 21) 

p2-l+ ctagctgcagctgttaacgtcgac (seq id no. 22) 

p2-l- aattgtcgacgttaacagctgcag (seq id no. 23) 

p2-2+ tatgagctcgaga (seq id no. 24) 

p2-2- tcgatctcgagctcatatgca (seq id no. 25) 

p2-3+ ctagcccgggcccactagttcgaat (seq id no. 26) 

p2-3- tcgaattcgaactagtgggcccggg (seq id no. 27) 

p3-l+ agctagatctgtctagacgcgtcatg (seq id no. 28) 

p3-l- acgcgtctagacagatct (seq id no. 29) 
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p3-2+ gatcggatcctaggcgc (seq id no. 30) 

p3-2- cgcggcgcctaggatcc (seq id no. 31) 

p3-3+ gatcaagcttaagcatg (seq id no. 32) 

p3-3- cgcatgcttaagctt (seq id no. 33) 

p4-l+ agcttaccggtctagagctcaattggccacccgggccctaggcctgcag (seq id 

NO. 34) 

p4-l- aattctgcaggcctagggcccgggtggccaattgagctctagaccggta (seq id 

NO. 35) 

p4-2+ gatcttaagtatcgatgccggcgccatggtaaccatatgttaactag (seq id no. 36) 

p4-2- tcgactagttaacatatggttaccatggcgccggcatcgatacttaa (seq id no. 37) 

p4-3+ tcgagatatcacgcgttcgaagctagcggccgcatgcggtaccagctg (seq id 

NO. 38) 

p4_3_ gatccagctggtaccgcatgcggccgctagcttcgaacgcgtgatatc (seq id 

NO. 39) 

Bgl x -Nru gatcgtcgcgac (seq id no. 40) 



Maintenance and replication of plasmids 

[0079] The plasmids of the present invention additionally comprise a selectable 
marker and bacterial origin of replication. The plasmids are replicated by growth 
of E.coli cells which have been transformed with the plasmid by standard 
techniques. Growth conditions are in standard growth medium, for example, 
Luria Broth, containing the antibiotic for which the plasmid contains a resistance 
gene, usually, but not limited to, ampicillin. After cell harvest, plasmid DNA is 
purified by standard plasmid DNA preparations, for example, Qiagen (V alencia, 
CA) or Sigma (St. Louis, MO) miniprep techniques. Plasmids in solution are 
stored frozen or in a dry (lyophilized) state. Cells carrying plasmids are stored 
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frozen in growth media or on agar plates at 4°C. 
Use of pDM digestion monitors 

[0080] DNA samples were digested with pairs of restriction enzymes in 
accordance with known techniques, alone and in the presence of pDM digestion 
monitors, and run on a 1% agarose gel. The results are shown in Figure 3. Lane 
1 contained plasmid YCpDANl-lacZ alone digested with EcoRI and Sail. The 
12kb plasmid contains an EcoRI and Sail site separated by 500 nucleotides The 
12kb plasmid is not, therefore, readily resolved from 1 1 .5kb. Lane 2 contains the 
same but includes pDMl . Lane 3 contains the same digestion reagents plus 
pDM2. In this sample, the pDM2 fragments run as a doublet. X phage DNA 
digested with BstEII was run in lane 4 as a molecular weight marker. A PCR 
fragment amplified from yeast ANB1 gene was run in Lane 5. This fragment 
contains an internal EcoRI site and a Hindlll site at the 5' end contained in the 
primer. The PCR product was digested with EcoRI and Hindlll. Lane 6 contains 
the same sample as Lane 5 with pDMl added. As shown by Figure 3, the absence 
of linear pDM indicates double digestion. 

Linear Monitors 

[0081] The digestion monitor plasmids of the present invention can be used in 
linear form. The advantage of this format is that, in cases where one of the 
enzymes being tested fails to cut, it can be determined which one isn't working 
simply by inspecting the gel pattern. To linearize the plasmid, a restriction site 
having a position far enough from the midpoint of the backbone to permit 
discrimination of gel migration patterns is chosen. So, for example, in one 
embodiment, the plasmids of the present invention are supplied pre-digested with 
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BglL Bgll is chosen because the position of the unique Bgll site is sufficiently far 
from the midpoint of the backbone to permit this discrimination. 

[0082] The following illustration depicts a plasmid containing restriction sites 
Bgll, A, B and C. The plasmid has been linearized by digestion with BglL 

Bgll ABC Bgll 



When the linearized plasmid is cut at sites B and C, the following fragments are 
obtained: 

Bgll A B B C C Bgll 



If, however, the B enzyme fails to cut, the resulting products would be: 

B B C 

I I I + I I 



Similarly, if the C enzyme fails to cut, the resulting fragments are: 

B C C 

I I I I + I 



The migration pattern of the two pairs of digestion products are easily 
distinguishable by gel electrophoresis, thereby indicating not only that the reaction 
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was incomplete but also providing specific information with respect to which 
enzyme failed. 

[0083] The present invention also provides a kit for monitoring the efficiency of 
an endonuclease digestion. The kit includes one or more plasmids containing 
restriction sites of utility to the molecular biologist and instructions for use of the 
plasmids to monitor the efficiency of a digestion using the endonucleases whose 
sites are represented on the monitor plasmid. The instructions may include, in 
part, a table or chart, for example, one similar to that shown in Figure 4, which 
indicates the plasmid that will be informative for the two restriction endonucleases 
to be used. A dot at the intersection of two sites indicates pairs of restriction 
endonucleases for which none of the plasmids is informative, for example, where 
the two sites are linked and therefore, are in the same polylinker in each of the 
four plasmids 

[0084] In another embodiment, the monitor plasmids might be supplied together 
with restriction endonucleases, and suitable reaction buffers, for the purpose of 
providing the purchaser with a means for testing the activity of enzymes in 
different combinations. 
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